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Abstract 


It  is  well  known  that  the  celebrated  Kojiina-Miznno-Yoshise  primal-dual 
interior-point  method  for  linear  programming  can  be  viewed  as  a  damped  per¬ 
turbed  Newton’s  method.  Recently,  Mehrotra  suggested  a  predictor-corrector 
variant  of  this  method.  It  is  currently  the  interior-point  method  of  choice  for 
linear  programming.  The  simplified  Newton  method,  at  the  expense  of  fast 
convergence,  reduces  the  work  required  by  Newton’s  method  by  reusing  the 
initial  Jacobian  matrix.  The  composite  Newton  method  attempts  to  balance 
the  trade-off  between  expense  and  fast  convergence  by  composing  one  Newton 
step  with  one  simplified  Newton  step.  In  this  work  we  demonstrate  that  if  the 
Newton  component  in  the  Kojima-Mizuno-Yoshise  primal-dual  method  is  re¬ 
placed  with  a  composite  Newton  component,  then  the  resulting  method  is  the 
Mehrotra  predictor-corrector  method. 


1  Introduction 

In  subsection  1.1  we  review  the  composite  Newton  method,  in  1.2  we  recall  the  primal- 
dual  interior-point  method,  in  1.3  we  present  Mehrotra’s  predictor-corrector  interior- 
point  method,  and  in  1.4  we  present  our  perturbed  composite  Newton  interior-point 
method.  Section  2  contains  equivalence  results  between  the  Mehrotra  predictor- 
corrector  method  and  the  perturbed  level- 1  composite  Newton  method.  Since  the 
level-1  composite  Newton  method  is  known  to  be  cubically  convergent,  in  Section  3  we 
study  the  cubic  convergence  aspect  of  the  Mehrotra  predictor-corrector  interior-point 
method  via  our  equivalence  result.  It  is  interesting  to  learn  that  the  interior-point  fea¬ 
ture  of  the  method,  i.e.,  the  step  is  damped  so  that  iterates  remain  positive,  precludes 
the  standard  proof  of  cubic  convergence  of  the  method.  However,  for  nondegenerate 
problems  it  is  possible  to  retain  quadratic  convergence.  Recall  that  Zhang,  Tapia  and 
Dennis  (1990)  demonstrated  that  the  primal-dual  interior-point  method  can  attain 
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quadratic  convergence  for  nondegenerate  problems.  We  then  prove  that  by  choosing 
steplength  one  in  a  neighborhood  of  the  solution,  cubic  convergence  can  be  attained 
by  the  predictor-corrector  interior-point  method  for  nondegenerate  problems. 

Numerical  experimentation  with  the  c.ubically  convergent  modification  is  most  im¬ 
pressive  and  has  been  relegated  to  a.  companion  paper,  El-Bakry,  Tapia  and  Zhang 
(1991),  which  numerically  studies  the  local  behavior  of  the  predictor-corrector  algo¬ 
rithm.  Clearly  an  optimal  implementation  of  the  composite  Newton  interior-point 
method  would  allow  m  (the  number  of  simplified  Newton  steps)  to  vary  at  each  New¬ 
ton  step.  This  issue  is  not  the  subject  of  the  current  work,  but  probably  merits  further 
study.  Finally,  in  Section  4  we  give  some  concluding  remarks. 


1.1  The  Composite  Newton  Method 


Consider  the  nonlinear  equation 


F(,;)  =  0 


(1.1) 


where  F  :  IR"  — >  IRn.  By  the  damped  Newton  method  for  problem  (1.1)  we  mean  the 
iterative  process 


solve  F'(xk)( Ax)  =  —F(xk)  for  Arc 

(1.2) 

set  Xk+i  —  Xk  +  a  a- Ax-  ,  k  =  0, 1, ...  . 

The  flexibility  of  being  able  to  choose  «a  less  than  one  is  important  from  global 
convergence  considerations.  When  the  choice  of  steplength  is  «a  =  1  we  drop  the 
qualifier  damped. 

Under  standard  assumptions  Newton’s  method  is  known  to  give  (^-quadratic  con¬ 
vergence.  Not  counting  the  work  required  to  evaluate  the  function  F  or  its  Jacobian, 
the  algebra  required  per  iteration  is  0(n3),  since  the  dominant  task  is  the  factorizing 
of  the  n  x  n  Jacobian  matrix  F'(xk).  For  large  n  this  can  be  a  very  serious  concern. 
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A  particularly  obvious  technique  for  reducing  the  amount  of  algebra  needed  at 
each  iteration  is  given  by  the  damped  simplified  Newton  method 

solve  F'(x0)( Ax)  =  —F(xk)  for  Ax- 
set  Xk- — -  Xk  -}-  cxk  Ax  ,  h  —  0, 1, . . .  . 

The  simplified  Newton  method  requires  an  initial  factorization  of  F'(x o)  and  then 
a  solve  at  each  iteration;  hence  it  requires  only  0(n2)  algebra  per  iteration.  However, 
it  gives  only  Q-linear  convergence  and  it  is  not  at  all  clear  in  what  cases  it  should  be 
preferred  to  Newton’s  method,  since  the  slow  convergence  might  force  a  prohibitive 
number  of  iterations. 

In  an  effort  to  cover  the  middle  ground  between  the  extremes  of  Newton  and 
simplified  Newton  it  is  very  natural  to  consider  the  variant  of  Newton’s  method  which 
takes  m  simplified  Newton  steps  between  every  two  Newton  steps.  By  the  damped 
(level-m)  composite  Newton  method  we  mean  the  iterative  procedure 

solve  F'(xk){Axi)  =  -F(xk  +  A.x0  H - h  Ax^))  for  A xt-  ,  i  =  0, . . . ,  m 

/  x  (1-4) 

set  £'fc+i  —  Xk  +  0(k(Axo  +  Ax'i  +  •  •  •  +  xrn)  ,  ,  A;  —  0,1,...  . 

Of  course  it  is  possible  to  introduce  a  different  steplength  control  akti  for  each  cor¬ 
rection  Ax,-,  i  —  0, . . . ,  m;  however  we  have  no  need  to  consider  such  flexibility. 

It  is  reasonably  well  known  that,  under  the  standard  Newton’s  method  assump¬ 
tions,  the  level-m  composite  Newton  method  has  a  Q- convergence  rate  of  m  +  2.  A 
proof  can  be  found  in  Chapter  10  of  Ortega  and  Rheinboldt  (1970).  The  damped 
level-1  composite  Newton  method  where  one  Newton  step  is  composed  with  one  sim¬ 
plified  Newton  step  is  of  particular  interest  to  us.  It  can  be  written 

solve  F\xk){ Axjv)  =  —F(xk)  for  Axyv 

solve  F'(xk)(Axs)  =  ~F(xk  +  Axn)  for  Axs  (1-5) 

set  xk+1  =  xk  +  ak(AxN  -f  Axs)  ,  k  =  0, 1, ...  . 


(1.3) 
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Ortega  and  Rheinbolclt  (1970)  credit  the  cubic  convergence  of  the  level- 1  composite 
Newton  method  to  Traub  (1964).  However,  the  notion  of  composing  Newton  steps 
with  simplified  Newton  steps  is  much  older  and  a  part  of  the  folklore  of  Newton’s 
method.  It  is  generally  felt  by  practitioners  that  the  formulation  of  composite  Newton 
steps  is  of  value  when  n  is  large  and  the  function  F  can  be  evaluated  cheaply;  this  is 
clearly  the  situation  for  the  primal-dual  interior-point  method  for  linear  programming 
described  in  Subsection  1.2. 

Observe  that  each  level-m  composite  Newton  iterate  can  be  viewed  as  a  major 
iterate  and  is  the  result  of  m  +  1  inner  iterations.  The  average  amount  of  algebra 
per  inner  iteration  is  0((n3  -f  mn2)/  (rn  +  1))  and  is  0(n2)  for  large  m.  The  average 
convergence  rate  for  the  inner  iterates  is  the  (m  -f  l)-st  root  of  m  +  2  and  behaves 
like  1  for  large  m.  It  is  no  surprise  then  that  for  large  m  the  level-m  composite 
Newton  method  behaves  like  the  simplified  Newton  method.  It  follows  that  an  optimal 
implementation  of  composite  Newton  would  not  only  vary  rn  at  each  Newton  step 
but  would  keep  m  relatively  small. 

1.2  The  Primal-Dual  Interior- Point  Method 

Consider  a  linear  program  in  the  standard  form 

minimize  cTx 

subject  to  Ax  =  b  (1-6) 

x  >  0 

where  c,  x  6  IRr\  b  €  IRm,  A  €  IR"tXn  (rn  <  n)  and  A  has  full  rank  m. 

The  first-order  optimality  conditions  for  the  linear  program  (1.6)  can  be  written 

'  Ax  -  b 

F{x,y,  A)  =  AT X  +  y  —  c 
v  XYe 
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0,  {x,y)>  0 


(1.7) 


where  y  6  IRn  and  A  €  IR"‘  are  dual  variables,  X  =  diag(.x),  Y  =  diag(y),  and 

er  =  (i,...,i)  e K". 

The  point  ( x ,  y,  A)  is  said  to  be  feasible  for  problem  (1.7)  if  Ax  =  b,  AT \  +  y—c  =  0, 
and  (x,y)  >  0.  A  feasible  point  (x,y,  A)  is  strictly  feasible  if  ( x,y )  >  0.  We  tacitly 
assume  that  strictly  feasible  points  exist. 

It  is  now  well  understood  how  the  primal-dual  interior-point  method  introduced  by 
Kojima,  Mizuno  and  Yoshise  (1989)  can  be  stated  in  the  framework  of  a  damped  and 
perturbed  Newton’s  method  applied  to  problem  (1.7).  In  presenting  this  algorithmic 
framework  we  will  write  z  =  (;c,  y,  A),  Az  =  (Ax,  Ay,  AA),  AX  =  diag(Az),  and 
AY  =  diag(Ay).  We  also  let  min(u)  denote  the  smallest  component  of  the  vector  u 
and  e  denote  the  vector  (0, . . . ,  0, 1, . . . ,  l)1  where  the  number  of  zeros  is  n  -f  m  and 
the  number  of  ones  is  n. 

Algorithm  1  (Primal-Dual  Interior-Point  Method) 

Given  z0  =  (x0,  ?/0,  A0)  with  (,t0,  y/o)  >  0,  for  k  =  0, 1, . . .,  do 


(1)  Solve  F'(zk)(Az)  =  -F(zk)  for  A zN 

(2)  Choose  fik  >  0  and 

solve  F'(zk)(Az)  =  pfcC  for  A zc 

(3)  Set  Az  =  Azjv  +  A zc 

(4)  Choose  Tfc  G  (0,1)  and  set  ak  =  min  (l,Tkak)  where 

Bt  m111  (  min  (X,-1  Ax)  ’  min  (Yf'Ay) 

(5)  Set  2fr+1  =  +  (viAe. 


(1.8) 


(1.9) 


Actually  in  most  implementations  the  formula  (1.9)  for  ak  is  further  broken  down 
and  one  steplength  is  used  to  update  the  ,T-va.riable  and  another  is  used  to  update 
the  y- variable  and  the  A-variable.  While  this  distinction  is  of  value  in  practice,  it  is 
not  an  issue  in  the  present  work  and  consequently  will  be  ignored. 
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Recently,  under  mild  assumptions,  Zhang,  Tapia  and  Dennis  (1990)  demonstrated 
that  for  nondegenerate  and  degenerate  problems  Q-super linear  convergence  could  be 
attained  by  Algorithm  1  by  merely  letting  crk  —*  0  and  q  — >  1,  where  cq  is  defined 
by  /q  =  o'kx'lyk/n.  Moreover,  for  nondegenerate  problems  (^-quadratic.  convergence 
could  be  attained  by  letting  =  0(x\yk)  and  q.  =  1  +  0(x[.yk).  Zhang  and  Tapia 
(1991)  showed  that  these  results  held  under  wea,ker  assumptions. 

The  Newton  step  A zjq  defined  in  Step  (1)  can  very  likely  point  toward  the  bound¬ 
ary  of  the  positive  orthant,  necessitating  a  very  small  choice  for  the  steplength  a. 
The  major  role  of  the  centering  step  A zc  defined  in  Step  (2)  is  to  remedy  this  situ¬ 
ation.  Hence  it  seems  quite  reasonable  that  the  choice  for  the  centering  parameter 
/q  should  also  be  a  function  of  the  Newton  step  A qy  This  is  particularly  true  in 
delicate  applications.  For  example,  recently  there  was  considerable  speculation  as 
to  whether  an  instance  of  Algorithm  1  could  have  both  polynomial  complexity  and 
superlinear  convergence.  In  their  original  paper  Kojima,  Mizuno  and  Yoshise  (1989) 
presented  choices  for  q  and  /q  leading  to  polynomial  complexity.  Zhang,  Tapia  and 
Dennis  (1990)  presented  conditions  on  q  and  /q  that  guaranteed  superlinear  conver¬ 
gence.  Zhang  and  Tapia.  (1990)  settled  this  concern  by  demonstrating  the  existence 
of  choices  for  q  and  /q  that  guaranteed  both  polynomial  complexity  and  superlinear 
convergence.  Ji,  Potra,,  Tapia,  and  Zhang  (1991)  extended  this  residt  to  linear  com¬ 
plementarity  problems.  In  both  these  applications  the  choice  of  /q  depended  strongly 
on  the  Newton  step  Azjv-  Hence,  the  centering  step  A zc  had  to  be  calculated  as 
described  in  Step  (2)  above.  However,  in  less  delicate  applications  where  /q  depends 
only  on  zk,  Steps  (1)  and  (2)  in  Algorithm  1  can  be  combined  and  the  combined  step 
Az  =  A z/v  +  A zc  can  be  obtained  as  the  solution  of 

F\zk){Az)  =  -F(zk)  +  ,ikek  .  (1.10) 

In  this  way  the  backsolve  required  by  Step  (2)  can  be  saved.  This  is  the  more  com¬ 
mon  presentation  of  the  Kojima, -Mizuno- Yoshise  algorithm  and  is  fine  lor  restricted 
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applications. 


The  sense  in  which  the  primal-dual  interior-point  method  can  be  viewed  as  damped 
perturbed  Newton  should  be  clear.  The  qualifier  damped  speaks  to  the  steplength 
otk  <  1  in  Step  (5).  The  qualifier  perturbed  speaks  to  the  tact  that  the  step  Az 
consists  of  the  Newton  step  A z/v  perturbed  by  the  centering  step  A zc\  see  Step  (3). 


1.3  The  Predictor-Corrector  Interior-Point  Method 

Mizuno,  Todd,  and  Ye  (1989)  suggested  and  studied  an  algorithm  which  they  labeled 
a  predictor-corrector  algorithm.  In  their  algorithm  the  predictor  step  is  a  damped 
Newton  step  for  problem  (1.7),  producing  a  new  strictly  feasible  iterate.  The  subse¬ 
quent  corrector  step  is  a  centered  Newton  step.  In  this  corrector  step,  the  choice  of  //, 
the  centering  parameter,  is  based  on  the  predictor  step.  Both  the  predictor  and  the 
corrector  steps  require  essentially  the  same  amount  of  work,  namely,  the  evaluation 
and  factorization  of  the  Jacobian  matrix. 

Mehrotra  (1989)  later  presented  the  following  variant  of  Algorithm  1,  which  he 
also  referred  to  as  a  predictor-corrector  method.  A  common  feature  in  these  two 
predictor-corrector  approaches  is  that  the  value  of  the  centering  parameter  in  the 
corrector  step  depends  on  the  predictor  step.  However,  unlike  Mizuno,  Todd  and 
Ye’s  corrector  step,  Mehrotra’s  corrector  step  does  not  evaluate  a  fresh  Jacobian 
matrix.  Instead,  it  reuses  the  Jacobian  matrix  used  by  the  predictor  step.  Recall 
that  e  =  (0, . . . ,  0, 1, . . . ,  1)T. 

Algorithm  2  (Predictor-Corrector  Interior-Point  Method) 

Given  z0  =  (x0,  yo,  A0)  with  (x0,?/o)  >  0,  for  k  =  0,1,...  do 

(1)  Solve  F\zk){Az)  =  -F(zk)  for  A zp 
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/ 


0 


(2) 

(3) 

(4) 

(5) 

(6) 


Solve  F'(zk)(Az)  = 


0 

AXpAyp 


for  Azm 


Choose  /ik  >  0  and 

solve  F'(zk)(Az)  =  /ike  for  Azc 
Set  Az  =  A zp  +  A zm  +  A zc 

Choose  Tk  €  (0,1)  and  set  a k  =  min  (l,T*a*)  where 


/ 


-1 


-1 


ak  =  mm 


vminXfc  1Arc)  ’  min(FA.  1A y)/ 
Set  zk+i  =  zk  +  otkAz  . 


(1.11) 


While  in  the  present  section  we  are  not  concerned  with  the  specific  choice  of  the 
initial  iterate  zq  or  the  various  algorithmic  parameters,  we  emphasize  that  Mehrotra 
suggested  choices  that  allowed  him  to  obtain  very  impressive  numerical  results. 


1.4  The  Perturbed  Composite  Newton  Interior-Point  Method 

In  this  subsection  we  present  our  perturbed  composite  Newton  interior-point  method 
for  problem  (1.7).  Recall  that  e  =  (0, . . . ,  0, 1, . . . ,  l)r.  Our  idea  is  to  replace  the  New¬ 
ton  component  in  the  primal-dual  interior-point  algorithm  with  a  composite  Newton 
component. 

Algorithm  3  (Level-m  Perturbed  Composite  Newton  Interior-Point,  Method) 

Given  z0  =  ( x0 ,  ?/u,  A0)  with  (x0,  y0)  >  0  for  k  —  0, 1, ...  ,  do 

(1)  Solve  F'(zk)(Az)  —  —F(zk)  for  A z0 

(2)  For  i  =  1, . . . ,  m  do 

Solve  F\zk){Az)  -  -F(zk  +  ^  Azj)  for  Az,  (1.12) 

j=0 

(3)  Choose  fi,k  >  0  and  solve  F\zk)(Az)  =  /x^e  for  Azc 
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(4) 

(5) 

(6) 


m 

Set  Az  =  ^2  Azi  +  Azc 

i= 0 

Choose  Tk  6  (0,1)  and  set  c\k  =  min  (1  ,rkak) 

.  (  -1  -1 
a*  =  mm  - - - r— — -  ,  - — — 

\min(A’fe  1A.c)  min(Ffc  1A7/) 
Set  zk+ 1  =  zk  +  a.kAz  . 


(l,Tkak)  where 
\ 


2  Predictor-Corrector  as  Perturbed  Composite 
Newton 

We  say  that  two  algorithms  are  equivalent  if  given  a  current  iterate  they  produce 
the  same  subsequent  iterate  for  the  same  choice  of  common  algorithmic  parameters. 

Theorem  2.1  The  predictor-corrector  interior-point  me  thod  (Algorithm  2)  is  equiv¬ 
alent  to  the  level- 1  perturbed  composite  Newton  interior-point  method  (Algorithm  3). 

Proof .  Let  z  =  ( x ,  y,  A)  be  the  current  iterate  and  let  A zp  =  ( Axp,  Ayp,  AXP)  be  the 
predictor  step  for  problem  (1.7),  i.e.,  A zv  is  obtained  from  Step  (1)  of  Algorithm  2. 
By  comparing  Algorithm  2  with  Algorithm  3  (in  =  1),  we  see  that  our  proof  will  be 
complete  once  we  show  that 

(  0  ) 

F(z  +  A  zp)  =  0  •  (2.1) 

^  A  Xj,  A  g  j,  ^ 

Writing  (2.1)  in  further  detail  gives 

A  ( x  -f  Ax.j,)  —  b  =  0 
AT(  A  +  AA?,)  -f-  (y  -f  Ayv)  —  c  =  0 
[a  d-  A.Xy,],'[?y  d-  A //p]*  [ dA ■  c ^ ] v [ Z\ g ,  , 
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i  =  1 ,. . .  ,n  . 


(2.2) 

(2.3) 

(2.4) 


By  expanding  we  see  that  (2.4)  is  equivalent  to 


[y]i[Axp}i  +  [x]i[Ayp]i  =  -[x] ,:[?/],  ,  *  =  1, . . . ,  n  .  (2.5) 

However,  (2.2),  (2.3)  and  (2.5)  a, re  exactly  the  defining  relations  for  the  Newton  step. 
Moreover,  from  the  definition  of  A zv  in  Step  (2)  of  Algorithm  2  it  is  clear  that  A zp 
is  the  Newton  step.  Hence  (2.1)  holds  and  we  have  established  the  equivalence.  □ 

While  in  Mehrotra  (1989)  no  explanation  for  the  predictor-corrector  method  is 
given,  in  a  more  recent  paper,  Mehrotra  (1990),  Mehrotra  offers  an  interpretation 
of  a  related,  but  somewhat  different,  algorithm.  Following  the  lead  of  Monteiro, 
Adler,  and  Resende  (1988)  he  constructs  a  standard  homotopy  in  a  parameter,  say  5, 
between  problem  (1.7)  and  a  problem  which  had  the  current  iteration  as  its  solution. 
The  primal-dual  trajectory  path  parametrized  by  5  gives  the  solution  of  problem  (1.7) 
for  6  =  0  and  the  current  iterate  for  5  =  1.  He  then  views  the  iterate  obtained  from 
the  predictor-corrector  method  as  a  point  on  a  quadratic  path  which  approximates 
the  primal-dual  trajectory  path. 

The  equivalence  represented  by  Theorem  2.1  was  conjectured  while  listening  to 
Mehrotra  discuss  his  predictor-corrector  method  at  the  Second  Asilomar  Workshop  on 
Progress  in  Mathematical  Programming,  Monterey,  California,  February  4-7,  1990. 
After  proving  Theorem  2.1  and  while  preparing  this  paper  we  received  the  paper 
of  Lustig,  Marsten,  and  Shanno  (1990).  In  this  paper  the  authors  describe  a  com¬ 
prehensive  implementation  of  the  Mehrotra  predictor-corrector  method  and  present 
impressive  numerical  results. 

Lustig,  Marsten  and  Shanno  (1990)  motivate  Mehrotra’s  predictor-corrector  method 
in  the  following  manner.  Rather  than  applying  Newton’s  method  to  (1.7)  to  gener¬ 
ate  correction  terms  to  the  current  iterate,  they  consider  F(z  +  A z)  =  fie  directly, 
yielding 

A{x  +  Ax)  =  b  (2.6a) 
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Ar(A  +  AA)  -  (y  +  Ay)  =  c  (2.6b) 

[x  +  Ax]i[y  +  A y]i  -  //.  ,  *  =  1, . . . , «  .  (2.6c) 

Simple  algebra  reduces  (2.6)  to  the  equivalent  system 

A  Ax  =  b  —  Ax  (2.7a) 

ATAX  -  Ay  =  c,  -  AT A  +  y  (2.7b) 

[x]i[A y]i  +  [i/li[Ax],  =  /<  -  [.r]i[?/]i  -  [Ax]t[Ay}i  ,  *  =  n  .  (2.7c) 


Observe  that  (2.7)  defines  the  step  (A.r,  A?/,  AA)  implicitly,  i.e.,  in  a  nonlinear 
manner.  In  order  to  determine  a  step  approximately  satisfying  (2.7)  it  seems  reason¬ 
able  to  first  solve  F(z)  =  0  for  the  Newton  (predictor)  step  (Axp,  A yp,  AXP)  and  then 
use  A xp  and  A yp  on  the  right-hand  side  of  (2.7)  to  solve  for  an  “improved”  step  from 
(2.7)  with  this  modified  right-hand  side. 

It  should  be  clear  that  the  presentation  (2.7),  with  Ax  and  Ay  replaced  by  Axp 
and  A yp  only  on  the  right-hand  side  reflects  the  level- 1  composite  Newton  method 
corresponding  to  an  unperturbed  version  written  in  the  form 

F\zk)Az  =  ~[F(zk)  +  F{zk  -  F'{zk)~1F(zk))\  ;  (2.8) 

while  Mehrotra’s  original  presentation  reflects  form  (1.5). 

Lustig,  Marsten  and  Slianno  (1990)  attempt  an  explanation  of  the  predictor- 
corrector  notion  in  terms  of  trajectories  parametrized  by  the  parameter  fi.  Their 
explanation  contains  some  ambiguity  in  that  it  is  not  clear  to  what  trajectories  they 
are  referring.  Moreover,  any  explanation  based  on  issues  derived  from  //  cannot  give 
a  complete  picture,  since  the  predictor-corrector  notion  still  makes  sense  even  when 
the  problem  formulation  is  free  of  //.,  i.e.  //.  =  0  in  all  cases.  However,  implicit  in  these 
authors’  comments  is  the  understanding  that  the  corrector  step  can  be  viewed  as  a 
perturbed  simplified  Newton  step. 
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3  Cubic  Convergence 


Much  of  the  following  analysis  follows  directly  from  material  in  Dennis  and  Schnabel 
(1983)  or  Ortega  and  Rheinboldt  (1970).  As  before  we  consider  problem  (1.7)  and 
use  the  notation  z  =  (,x,  y,  A).  Also,  recall  that  e  =  (0, . . . ,  0, 1, . . . ,  1)T  where  the 
number  of  zeros  is  n  +  m  and  the  number  of  ones  is  n.  The  pure  Newton  method  can 
be  written 

N(z)  =  z  -  F\z)~lF(z)  (3.1) 

and  the  predictor-corrector  interior-point  method  can  be  written 

N(z)  =  z  -  a.F\z)-l[F{z)  +  F(N(z))  -  fie]  .  (3.2) 


Therefore 

N{z)-z *  =  z  -  z.  -  F\z)~l[F(z)  +  F{N{z))] 

+(1  -  a)F'(z)-l[F(z)  +  F(N(z))}  +  afiF\z)~l e 
=  F'(z)~1[F'(z)(N{z))  -  F(N(z))  -  F'(z)(z, .)] 

+(1  -  a)F'{z)[F(z)  +  F{N{z))}  +  a.f,,F\z)-xe 
=  —F'(z)~1{[F(N(z))  -  F(z .)  -  F\z*){N{z)  -  Zm)]  (3.3) 

+[nO  -  F'(z)](N(z)  -  z.)} 

+(1  -  a)F\z)[F(z)  +  F(N(z))}  +  a,iF'(z)-xe. 

Now,  locally,  i.e.  in  a  neighborhood  of  the  solution  z„,  we  know  from  standard 
Newton’s  method  analysis  that 

||JV(z)-z.||  =  0(||z-z*||2). 

Hence,  we  can  rewrite  the  four  terms  on  the  right-hand  side  in  (3.3)  and  obtain 

||iV(z)  -  z*\\  =  0{\\z  -  Z*||4)  +  0(\\z  -  z,||3)  +  |1  -  a\0(\\z  -  HI)  +  nO(  1)  ; 
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which  simplifies  to 


||iV(z)  -  0*||  =  0(\\z  -  2T*||3)  +  |1  -  «|0(||z  -  0*||)  +  flO(  1)  .  (3.4) 

In  deriving  (3.4)  we  used  the  fact  that  ||F(0)||  =  0(\\z  —  0* ||)  and  ||F(iV(0))||  = 
0(\\z  -  z*\\2). 

The  term  /t0(  1)  can  be  made  0(\\z  —  z*||3)  by  the  choice  of  /<.  Everything  now 
hinges  on  the  term  |1  —  a|0(||z  —  0* ||).  We  must  therefore  take  a  very  close  look  at 
the  quantity  1  —  a.  Clearly,  for  cubic  convergence,  we  need  |1  —  a\  to  be  0(\\z  —  0* ||2). 

Assuming  strict  complementarity,  0*  is  a  nondegenerate  vertex  solution,  and  Zk  is 
feasible.  Zhang,  Tapia  and  Dennis  (1990)  obtained  the  useful  expression 

1  -  at  =  *  ~  ^  (*■*> 

for  the  Newton  interior-point  method.  See  (3.7)  of  Zhang,  Tapia  and  Dennis  (1990). 
In  (3.5),  Tfc  and  are  as  in  Algorithm  1,  €  (£,1]  and  0(xlyk)  is  not  necessarily 

zero  and  is  exactly  first  order.  Observe  that  O(xJijk)  =  O(||0  —  0*||),  since  for  feasible 
zk  we  have  xfajk  =  ||i?(0r-)||i. 

For  the  present  purpose  of  studying  1  —  at ,  the  predictor-corrector  primal-dual 
interior  point  method  and  the  primal-dual  interior-point  method  are  philosophically 
the  same,  i.e.,  both  can  be  viewed  as  perturbed  Newton.  In  the  former  case  the  per¬ 
turbation  to  the  right-hand  side  of  the  defining  relation  is  //,e  —  F(z  —  F'(z)-1  F(z)), 
while  in  the  latter  case  the  perturbation  is  merely  fie.  Observe  that  these  two  pertur¬ 
bation  terms  differ  by  a  term  which  is  order  0(\\z  —  0*||2)  or  equivalently  0((xjyk)2). 
Hence  (3.5)  is  also  valid  for  the  Newton  predictor-corrector  interior-point  method. 
It  can  now  be  seen  from  (3.5)  that,  independent  of  the  choices  for  7y  and  cr*,,  the 
term  |1  —  a^|  is  at  best  O(||0  — z*||)  and  the  Newton  predictor-corrector  interior-point 
method,  even  for  nondegenerate  problems,  cannot  be  shown  to  be  cubically  conver¬ 
gent  by  the  standard  approach.  However,  by  choosing  oa:  =  1  near  the  solution  and 
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pk  —  0{{xlykf)  we  see  from  (3.4)  that  it  is  possible  to  obtain  cubic  convergence.  We 
formally  state  these  observations  as  the  following  theorem. 

Theorem  3.1  Let  { xk ,  Uk,  W}  he.  produced  by  Melirotra’s  predictor- corrector  interior- 
point  method  with  z0  strictly  feasible.  Assume 

(i)  strict  complementarity, 

(ii)  x*  is  a  nondegenerate,  vertex,  and 
(in)  {(xk,yk,Xk)}  converges  to  (x*,  ;</*,  A»). 

If  the  choices  of  erk  and  Tk  satisfy 

0  <  ak  <  minfcr,  Ci(xjyk))  (3.6) 

and 

0  <  rk  <  minfr ,  1  -  e2xj.yk)  (3.7) 

where  a  G  [0,1),  r  G  (0,1)  and  c\,c2  >  0,  then  the  convergence  is  Q-quadratic,  i.e. 
there  exist  72  >  0  such  that  for  k  large 

||  (x'fc+1  1  IJk+l  i  Afc+1  )  //*,  A*)  ||  ^  7'2  II i'^ki  !lki  ^k)  (x*,  //*,  A*)  || 

On  the  other  hand,  if  instead  of  (3.(i)  we  have 

0  <  vk  <  min((T,Ci(x{.yky)  (3.8) 

and  instead  of  (3.7)  we  have  that  for  large  k 

ak  =  1  ,  (3.9) 

then  the  convergence  is  Q -cubic,  i.e.  there  exist  73  >  0  such  that  for  k  large 

|| (x'fc+i ,  Z/fc+i >  W+i )  (x*,  ?/»,  A*) ||  G:  7.'?||(xfc,  Vki  Afc)  (x*,  y*i  A*) ||  • 

Proof.  The  proof  follows  from  combining  the  discussion  given  above  with  the  details 
given  in  Zhang,  Tapia,  and  Dennis  (1990)  for  the  proof  of  Theorem  4.1.  □ 
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4  Concluding  Remarks 


In  this  paper  we  have  studied  the  Mehrotra  predictor-corrector  philosophy  and  demon¬ 
strated  that  it  is  equivalent  to  the  level- 1  perturbed  composite  Newton  philosophy. 

We  were  intrigued  by  the  discovery  that,  while  the  level-1  composite  Newton 
method  is  known  to  be  cubically  convergent,  this  standard  convergence  rate  proof 
applied  to  the  predictor-corrector  interior-point  method  gives  at  best  quadratic  con¬ 
vergence.  The  limitation  of  the  standard  proof  results  from  the  constrictive  steplength 
choice  forced  on  the  method  by  the  interior  point  philosophy,  i.e.,  requiring  the  it¬ 
erates  to  remain  strictly  feasible  with  respect  to  the  nonnegativity  constraints.  We 
demonstrated  that  if  one  drops  the  interior-point  aspect  of  the  predictor-corrector 
method  locally,  i.e.,  in  a  neighborhood  of  the  solution  steplength  one  is  selected,  and 
also  chooses  the  centering  parameter  to  be  of  the  order  of  the  duality  gap  cubed,  then 
cubic  convergence  can  be  attained  for  nondegenerate  problems. 

The  research  presented  in  Zhang,  Tapia,  and  Dennis  (1990),  in  Zhang,  Tapia 
and  Potra  (1990),  and  the  present  research  leads  us  to  conjecture  that  we  should 
implement  Newton  interior-point  methods  and  their  variants  in  a  manner  which  near 
the  solution  sets  the  centering  parameter  to  zero  and  takes  steplength  one,  i.e.,  as 
old-fashioned  Newton.  Our  preliminary  numerical  experiments  employing  this  idea 
were  impressive  and  motivated  the  more  general  study  described  in  the  companion 
paper  El-Bakry,  Tapia  and  Zhang  (1991).  The  reader  is  referred  to  that  paper  lor 
numerical  results. 

Acknowledgement.  The  authors  thank  an  anonymous  referee  lor  critical  com¬ 
ments  that  forced  them  to  give  a.  more  consistent  treatment  ol  various  conceptual 
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